Negatively Correlated Bandits∗
نویسندگان
چکیده
We analyze a two-player game of strategic experimentation with two-armed bandits. Each player has to decide in continuous time whether to use a safe arm with a known payoff or a risky arm whose likelihood of delivering payoffs is initially unknown. The quality of the risky arms is perfectly negatively correlated between players. In marked contrast to the case where both risky arms are of the same type, we find that learning will be complete in any Markov perfect equilibrium if the stakes exceed a certain threshold, and that all equilibria are in cutoff strategies. For low stakes, the equilibrium is unique, symmetric, and coincides with the planner’s solution. For high stakes, the equilibrium is unique, symmetric, and tantamount to myopic behavior. For intermediate stakes, there is a continuum of equilibria. ∗We are grateful to Philippe Aghion, Patrick Bolton, Kalyan Chatterjee, Martin Cripps, Matthias Dewatripont, Jan Eeckhout, Florian Englmair, Eduardo Faingold, Philipp Kircher, George Mailath, Timofiy Mylovanov, Stephen Ryan, Klaus Schmidt, Larry Samuelson, as well as seminar participants at Bonn, Munich, UPenn, Yale, HEC Paris, the 2007 SFB/TR 15 Summer School in Bronnbach, the 2007 SFB/TR 15 Workshop for Young Researchers in Bonn, the 2008 SFB/TR 15 Meeting in Gummersbach, the 2008 North American Summer Meetings of the Econometric Society, the 2008 Meeting of the Society for Economic Dynamics and the European Summer Symposium in Economic Theory (ESSET) 2008 for helpful comments and suggestions. We thank the Department of Economics at the University of Bonn, the Business and Public Policy Group at the Wharton School, the UPenn Economics Department and the Studienzentrum Gerzensee for their hospitality. Financial support from the Deutsche Forschungsgemeinschaft through SFB/TR 15 and GRK 801 is gratefully acknowledged. †Munich Graduate School of Economics, Kaulbachstr. 45, D-80539 Munich, Germany; email: [email protected]. ‡Department of Economics, University of Munich, Kaulbachstr. 45, D-80539 Munich, Germany; email: [email protected].
منابع مشابه
ategic Learning inTeams
This paper analyzes a two-player game of strategic experimentation with three-armed exponential bandits in continuous time. Players face replica bandits, with one arm that is safe in that it generates a known payoff, whereas the likelihood of the risky arms’ yielding a positive payoff is initially unknown. It is common knowledge that the types of the two risky arms are perfectly negatively corr...
متن کاملGlobal Bandits
Standard multi-armed bandits model decision problems in which the consequences of each action choice are unknown and independent of each other. But in a wide variety of decision problems – from drug dosage to dynamic pricing – the consequences (rewards) of different actions are correlated, so that selecting one action provides information about the consequences (rewards) of other actions as wel...
متن کاملAsymptotic optimal control of multi-class restless bandits
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is acontrollable process whose state evolution depends on whether or not the bandit is made active. Theaim is to find a control that determines at each decision epoch which bandits to make active in orderto minimize the overall average cost associated to the states the bandits are in. Sinc...
متن کاملResourceful Contextual Bandits
We study contextual bandits with ancillary constraints on resources, which are common in realworld applications such as choosing ads or dynamic pricing of items. We design the first algorithm for solving these problems that improves over a trivial reduction to the non-contextual case. We consider very general settings for both contextual bandits (arbitrary policy sets, Dudik et al. (2011)) and ...
متن کاملEffects of Different Water Stress on Photosynthesis and Chlorophyll Content of Elaeagnus rhamnoides. Hamid Ahani1 *, Hamid Jalilvand2, Jamil Vaezi3 and Seyed Ehsan Sadati4
We studied the response of Elaeagnus rhamnoides (Sea Buckthorn) to drought stress in a nursery. Photosynthesis and chlorophyll content under drought were change rather modest. Growth and physiological differences in response to drought were compared between four Sea Buckthorn seedlings treatments inhabited in the Qazvin provenance origin seeds in Mashhad city of Iran. The experimental design in...
متن کامل